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P706097GB 



Height-related Gene 



The sex-related height difference in humans is tiiought to be caused maintyr by two 



conq)onents: first, a hormonal conqponent determmed the s&k dimorphism of bioactive 



termed GCY (Tanner, et aL 1966; Smith, et aL 1985; Ogata and Matsuo, 1992). Despite 
extensive mapping attenapts for this gene on the human Y chromosome (Ogata, et aL 1995, 
Salo, et aL 1995, Rousseaux-Prevost, et aL 1996, De Rosa, et aL 1997), its precise position 
remains iinknown. Recent evidence shows that inappropriate cytogenetic methodology in the 
characterization of Y-chromosomal terminal deletions has brought about some of the 
difGcuMes in elucidating the GCY-critical region. In order to overcome these problems, the 
inventors have considered only patients presenting de novo interstitial deletions for the GCY 
anafysis on the Y chromosome (Kirsch, et aL 2000). This approach allows the assignment of 
GCY to a particular chromosomal interval without excluding the presence of XO-mosaicism 
and/or i(Yp) and idic(Yql 1) chromosomes in patients vsrith terminal deletions. 

The direct comparison of overlapping interstitial deletions in seven adult males with normal 
height, one male, with borderline height, and one patient with a large interstitial deletion and 
short stature resulted in the confirmation of the GCY critical interval between markers DY23 
and DYSll. This region roughly encon5)asses 1.6-1.7 Mb of genomic DNA. To improve the 
resolution in the region of interest close to the centromere, the inventors have established 
additional new STS markers specific for this part of the chromosome using our bacterial 
artificial chromosome (BAC)/P1 -derived artificial chromosome (PAC) contig. Molecular 
deletion analysis using these new Y-chromosomal STSs allowed the inventors to narrow 
down the critical interval to a genomic region of 700 kb. 

The invention provides an isolated region of the Y chromosome between DY23 and DYSl 1 
which aicompasses GCY. Preferably the Y chromosome is a human Y chromosome. 



gonadal steroids and second, a genetic component attributed to a Y-specific growth gene. 



The invention fiirther provides isolated gene/pseudogene sequences which contributes the sex 
related height difference in humans. 
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The invention further enconqjasses proteins having the same function as GCY protein and 
which have greater than 65o/o homology, greater than 70o/o homology, greather than 75V. 
homology, greater than 80% homology, greater than 85% homology, preferably greater than 
90% homology, and most preferably greater than 95% homology to the GCY protein. 

Experimental evidence wiB now be described in detail with reference to the figures in which: 
Table 1 is a con5)arison of the adult height of patients and their siblings. 
Table 2 is a table of new Y chromosomal STSs 

Table 3 is the PCR/restriction digest analysis of sequence femify variants in the AZTc region 

Table 4 is a summary of BAG and PAC clones identified during physical map creation. 

Table 5 is a summary of the genomic primers that wifl be used for microdeletion screening 
in adult males with idiopathic short stature. 

Table 6 is a summary of the sequences of the isolated exon tr^ clones 

Table 7A is a summaiy of primer pairs for predicted genes 

7B is a summary of primer pans specific for the Y-copy of Adlican 



Figure 1. Deletion mapping on the long arm of the human Y chromosome. 

A diagram of the human Y chromosome with Yp telomere to the left and Yq telomere to the 
right is presented at the top. Shown below are the results of low-resohition anafysis of Y- 
chromosomes of adult males with normal height or short stature. Along the top border, 95 Y- 
chromosomal STSs are listed. Except for SKY3 and SKY8 (see Table 2 for detail), all other 
STSs were previously reported (Vollrath et aL, 1992, Jones et aL, 1994. Reijo et aL, 1995). 
Blank spaces or Prey Wcjs indicate ismr^ct sbs-scs or prssiacc of rcarksrs fci v^idch osssv 




was not pe^onned. Asterisks indicate markers in the respective breakpoint regions which 
could not be tested In all cases vdiere previously published data of the patients were re- 
investigated, the identical DNA sample used for the primary analysis was studied. (Please 
note that the proximal as weU as the distal breakpoint of the interstitial deletion of patirat 
#293 resides within satellite type n sequences.) 

Figure 2. Sequence family variant (SFV) typing in the human DAZ locus in distal 
Yqll.23. 

A. Overview and amplicon structure of the human Y chromosome in the vicinity of the 
himian DAZ cluster. Each amplicon is represOTted by specific bands (A, B, D, E, X). Shown 
above are arrows indicating the orientation of each member of an an^licon &xn5fy with 
respect to each other. The amplicon indicated by bands X arose &om a portion of 
chromosome 1 that was transposed to the distal end of the DAZ cluster and partially 
duplicated. 

B. Precise position of selected Y-specific STSs and the SFVs according to tiie physical m£q> 
of the human Y chromosome. Marker sYlS7 is hig^hlighted as it was suspected to be present 
in only one copy by multiplex PGR analysis (see text for detail). 

C. Summary of STS and SFV analysis in patients with Y- chromosomal rearrangements 
within die human DAZ cluster region. Grey boxes indicate inferred absence or {sresence of 
markers. 

D. Sequence femily variant typing of SKYIO and SKY12 in genomic DNA of patient #1972. 
Assay is described in Table 3. Along the right are listed fragment sizes (in bp). Products are 
separated by electrophoresis in 3% NuSieve agarose (3:1) and visualized by ethidium bromide 
staining. 

Figure 3. Schematic representation of the organization of the long aim 

pericentromeric region of the human Y chromosome 
A. Diagram showing the distribution of ntiajor tandem repeat blocks and general organization 
of sequence homologies. Basica%, the region can be subdivided in three distinct intervals: a 
proximal region characterized by Sbp satellite sequences (G), a central region with high 
homology to chromosome 1 (O), and a distal region composed of X/Y-homologous sequences 



(B). Below the precise position of the newly established and previously published STS 
markers in this region are iUustrated. At the bottom border, the PAOBAC contig constructed 
with the aid of the new STS markers is shown. Prefixes RPl, 5 indicate PAC clones and RPl 1 
BAG clones, respectively. 

B. Localization of the GCY critical interval as defined by high-resohrtion STS mapping in 
patients with short stature and normal height. Black boxes indicate the presence, white boxes 
the absence of the respective STS. Strqied boxes depict the dosage unknown regions vfh&e 
the breakpoint resides. 



Figure 4. Genomic properties of the minimal GCY critical region 

4A. Diagram presenting an enlarged view of the GCY critical region. WWiin the homology- 
specific coloured genomic subxntervals the PAC and BAC clones covering the mimmal GCY 
critical region are shown against "Mapped PACs/BACs". Corresponding BAC clones 
sequenced by the Human Genome Project are shown against "Sequenced BAC clones" 
beneath. The bottom layer ("Genomic Primers") indicates the genomic distributk>n of the 
primer pairs used for microdeletion screening of adult males with idiopathic short stature. 
4B. Summary of all potentiaUy fimctional sequences within the GCY oitical region. For the 
ease of overview, the genomic location of the sequenced BAC clones is renewed at the top 
layer. Shown below are the results of the gene predktion anafysis. Precise position, 
orientation, and exon/intron structure of each of the potential transrq)tional units is indicated. 
Ptomoter predictk>ns obtained by the program FirstEF are presented beneath. The next layer 
summarizes in a similar £ishion the properties of 4 apparent pseudogenes detected in this 
genomic region by BLAST homology searches. For completion, ARSF and RPS24Y axe 
added. The bottom border presents the genomic location of the isolated exon trap clones. 



Materials and Methods 



Defining the GCY critical region 



Selection of patients 

Patients #293, JOLAR, ^8, #63 and #95 have been described clinically in detail elsewhere 




corresponds to case 1 in the study of Pryor et aL 1997. Patients T.M., #1947 and #1972 are 
phenotypicalty normal noales suffering from idiopathic infertility. Gmomic DNA sanq)les 
were extracted from perq)heral blood leukocytes (#28, #63, #95, Y0308, T.M., #1947, #1972) 
or from ]ynq)hoblastoid cell lines (#293, JOLAR). DNA isolated from peripheral blood 
leukocytes of normal males and females served as mtemal controls. 

Height assessment 

As aU individuals are of diverse ethnic origins, height was conq>ared to the respective national 
height standards (Table 1). Patients were of similar age range. When possible, special 
attention was given to adult height comparisons between parents and siblings. Data are 
summarized along with the height standard deviation score (SDS) in Table 1. To calculate the 
SDS, mean adult height and the standard deviation were taken from the corresponding 
national physical growth studies. 

PGR analysis 

Reactions were performed m a total volume of SOjU (75mM Tris/HCl pH9.0, 20mM (NH^)^ 
SO4, 0.1%(w/v) Tween20, l.SmM MgCk) containing l.OmM of each oligonucleotide primer, 
lOOng genomic DNA as template, 5 units of Taq DNA polymerase (Eurogentec), and each 
dNTP at ImM in a themiocycler (MJ Research, Inc.) as follows: After an initial denaturation 
step of 95X for Smin, sanqples were subjected to 30 cycles consisting of 30sec at 94''C, 30sec 
at 60°C and Imin at 72^C followed by a final extension step of Smin at 72°C. The Mult^lex 
PGR was carried out as described in Henegariu et aL 1994 with minor modifications. Alu-Alu 
PGR reactions were essentially carried out as described ia Nelson et aL 1991. Amplification 
products smaller than 1 kb were resolved on 3% NuSieve agarose/1 %SeaKem GTG agarose 
(FMG) m 1 X TBE (0.089 M Tris-borate/0.089 M boric acid/20mM EDTA, pH 8.0). For 
anq>lification products larger than 1 kb as well as products f:om Alu-Alu-VCS^ 1.5% SeaKem 
GTG agarose gels in 1 x TBE were used for separation. 

PGR primers 

Y-specific STSs, loci and PGR conditions have been described previously (VoUrath et al, 
1992; Jones et aL 1994; Reijo et aL 1995). Sequences of new Y-chromosomal STSs are listed 
in Table 2. Y-specific STSs termed SKY were either derived from YAC, BAG and PAG end 
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sequences or from clone-internal sequences amplified by various comMnatfons of Alu 
pnmers. Primers for the markers SKYIO, 11. 12, and 13 >^ designed to an^Kfy fragments 
spamiing unique restriction sites within the genomic DAZ locus (SKYIO from RPl 1-487K20 
(AC024067), RP11.70G12 (AC006983). RP11.141N04 (AC008272), RP11.366C06 
(AC015973), RP11.560I18 (AC053522). RP11-175B09 (AL359453), SKYll and SKY12 
from RPl 1.245K04 (AC007965). RP11.100J21 (AC017005), RP11.506M09 (AC016752) 
RP11-589P14 (AC025246) and SKY13 from RP11-100J21 (AC017005). RP11-589P14 
(AC025246), RPI1.823D08 (AC073649), RP11-251M08 (AC010682). RP11-978G18 
(AC073893)) in order to detect 'sequence femify variants' (SFVs). 

Restriction annlygig of PCR p mHiints 

PGR products were resolved on agarose gels, the appropriate gel bands cut out and the DNA 
isolated with GFX™ PGR DNA and Gel Band Purification Kit 
(Amersham Pharmacia Biotech, Inc.) according to the manufectm^r's protocol Fragments 
amplified from SKY5 and SKY6 were digested with TaqI and BsmI, respecth^fy. To detect 
SFVs at SKYIO. SKYll. SKY12 and SKY13. PGR products were digested with restriction 
enzymes as listed in Table 3. 

Seouencinp of BAC/PA Q/YAC end frapty ^i^tf 

DNA from BAG/PAG clones selected fi>r end sequencing were purified with the Nucleobond 
PClOO Kit (Macherey-Nagel) according to the manufecturer's instructions. End fragments 
were directfy sequenced using the Thermosequenase Fluorescent LabeUed Primer Gycle 
Sequencing Kit (Pharmacia) and analyzed on a Pharmacia A.L.F. egress (Amersham 
Pharmacia Biotech). YAG end fragments were generated with AluA^ector-potymerase chain 
reaction and subcloned in pGR2.1 with the TOPO-TA clonmg Kit (Invitrogen). Sequencing 
was performed as described. 

Fluorescen ce in situ hv hridigatirtn 

Metaphase spreads were obtamed either from primary blood samples or immortalized cell 
lines. Preparations were made according to standard protocols (Lichter and Gremer 1992). 
Cosmid and plasmid DNA was labeled by nick translation with biotin-16-dUTP (La Roche). 

^li/Ji^'O (f»0««TTW»r= 7Y«.»-V**>^*U*5j#»^ CI ,. 1 . • .TV-LA.- * .^^^ 




of labeled plasmid or cosmid DNA, 20-30tig of human Cot-1 DNA (GIBCO BRL), and 
hybridization buffer (50% formamide, 10% dextran sulfite, and 2 x SSC, pH 7.0) were 
mfaced, denatured for 5min at 75*^0 and pre-annealed for 30min at 37*^0. The sKdes were 
denatured for 2 min in 70% formamide and 2 x SSC, pH7.0, at 72^C (Ried et aL 1992). The 
pre-annealed probe was hybridized overnight in a humidifying chamber at 37''C. SKdes were 
washed and stained with avidin-conjugated fluorescein isothiocg^anate (FITC). The signal was 
anQ)lified with biotinylated anti-avidin followed by staining with avidin-FTTC. For the probe 
all human telomeres (Oncor) the instructions supplied by the manufecturer were followed. 
Chromosomes were counterstained with 4',6-diamidino-2-phenylindol dihydrochloride 
(DAPI). Images were taken separately by using a cooled charge coupled device camera 
system (Photometries, Tucson AZ, USA). A Macintosh Quadra 900 was used for camera 
control and digital image acquisition in the TIP format using the software package Nu200 2.0 
(Photometries). Separate gray scale fluorescence images were recorded for each 
fluorochrome. Lnages were overlaid electronically and further processed using the Adobe 
Photoshop software. 

Searching the stature gene 

Nficrodeletjon screening 

Exon amp lificfltinn 

Shotgun subcloning of PAC clones into pSPL3B, Genomic DNA from chromosome Y 
specific PAC clones was partially digested with SauSAI. lOOng of isolated fragments in the 
range of 4-lOKb were ligated with lOOng of pSPL3B that had been BamHI digested and 
dephosphorylated. The ligation reaction was transformed into supercompetent E.coU XI- 1 
blue cells (Stratagene) and aliquots of each transformation plated on selective medium 
(ampicillin). Resulting colonies were subsequently pooled for plasmid DNA isolation. 

Cell culture and electroporation. COS7 cells were propagated in DME noedium supplCTGuented 
with 10% heat inactivated calf serum. For transfections COS7 cells in between the 5* and 15* 
passage were grown to about 75% confluence, trypsinized, collected by centrifiigation and 
washed in ice-cold Dulbecco's PBS. 4x10^ cells were then resuspended in cold 0.7ml 



Dulbecco's PBS and combined in a precooled electroporation cuvette (0.4om chamber 
BioRad) vdth 0.1ml Dulbecco's PBS containing ISpg DNA. After lOnm on ice, cells weri 
gently resuspended, electropoiated (1.2kV. 25^0 in a BioRad Gene Pulser 2 and placed on 

ice again. After lOmin celb were tiansfened to a tissue culture did! (lOO^^ 
prewanned, CO2 preequilibrated culture medium. 

RNA isolation, RT-PCR arid cloning. Cytoplasmic RNA was isolated 72hrs post transfection 
(QIAGEN RNeasy Kit) and fcst strand synthesis was perfonned as recommended by the 
nianufecturer with minor modifications: Sjtg of RNA was added to a solution containing 
lOmM of each dNTP and 2^U of oHgonucleotide SA2. The mixture was heated to 65-C for 
5min and tiien placed on ice for at least a further minute. After adding a reaction mixture 
containing lOx PGR buffer (Perkin-Ehner Cetus). 25mM MgCfe. O.IM DTT and RNAsm 
(35U/HI) , tiie reverse transcr^tion reaction was transferred to 42°C for 2min. Ijil of 
SuperScr^ H RT (200U/nl; GiTdco BRL) was then added and the reaction incubated at 42<'C 
for 9Qmin and 50«>C for 30min. The entire cDNA synthesis reaction was tiien converted to 
double strand DNA using a limited number of PGR an5)lification cycles in tiie foHowing 
IOOmI reaction mfactme: Ix PGR buffer (Perkin-Ehner Cetus), 1.5mM MgCfc, 200nM dNTTs, 
1HM SA2, luM SD6 and 2.5U Taq polymerase (Perkin-Ehner Cetus). 6 an^lification cycles 
were used and consisted of Imin at 94°C, Imin at eO-'C and 5min at 72°C. To eliminate 
vector-only and felse positive products, SOU of BstXI (New England Biolabs) was added 
directiy to the reactions, followed by overnight incubation at 55°C. 

10^1 of the digest was then used in a second PGR au9)lification using internal primers in the 
following 100^1 reaction mixture: IxPGR buffer (Perkin-Ehner Cetus), 1.5mM MgCfc, 
200mm dNTPs, IhM (GAU)4-SD2, IjiM (GUA)4-SA4 and 2.5U Taq polymerase (Perkin' 
Ehner Cetus). 25 amplification cycles were used and consisted of Imin at 94°C, 1mm at 60°C 
and 3min at 72*>G. Products were separated by electrophoresis and fragments larger tiian tiie 
pure SD2/SA4 RT-PGR product excised and subcloned (CloneAmp pAMPl System; Gibco 
BRL) into pAMPl according to tiie manufecturer's protocol. Ligation reactions were tiien 
transformed m uItracon5)etent E.coK XL-2 bhie (Stratagene) and plated on selective medium 
containing X-Gal/IPTG. 




Identification of candidate exons. All white colonies were picked and transferred to 384-weIl 
microtiter plates containing selective medium and incubated ovemig^ at 37^C. With a 3S4- 
pin transfer device 24.Sx24.San culture plates with and without positive]^ charged nylon 

mranbranes (Amersham) on top of them were inoculated and also incubated overnight at 

I. 

37^C. Colonies grown on culture plates were i>ooled for plasmid preparation, colonies on 
cylon membranes were used for colony lifts. Plasmid inserts were excised, purified, and 
iQrbridized to nylon membranes containing EcoRI-digests of the PAC clones used as the 
original substrate. Highlighting bands were subsequently isolated and hybridized to colony 
lifts to identify candidate exons. Candidate exons were isolated and sequenced by Sequitherm 
EXCEL n DNA Sequencing Kit (Epicentre Technologies). Sequences were automatical^ 
anafyzed and read on an ALFExpress DNA sequencer. Table 6 lists the sequences of the 
isolated exon trap clones. 

Insilico analysis 

The genomic sequence was analyzed widi the NIX (Hinxton Hall, Canihridge, UK) and 
RUMMAGE (1MB, Jena, Germany) analysis software packages. A total of IS potential gene 
models were proposed and homologies to 6 known genes were found. 

Results 

Ma pping of interstitial deletions 

We studied the DNA of nine adult males which originally consulted reproduction centers 
about idiopathic infertility, but were otherwise generally healthy. Of the 9 males, 7 were 
unremarkable with respect to adult height. One patient, #293, with a height of 157cm, 
presented short stature (SDS -2.9) and one, Y0308, with a height of 165.5cm showed 
borderline height, being at the 3"* percentile of normal U.S. height standard (SDS -1.7). Adult 
height of his parents and siblings are in the normal range (Table 1), his brother being 20.5cm 
taller than the patient. Conq>ared to his target height (178cm) and target range (169-187cm) 
he can be considered short. All men were ascertained solety on the basis of the occurrence of 
large de novo interstitial deletions on the Y chromosome. Only two of those patients had 
undergone previous chron:iosomal studies. 




In our effort to localize the GCY locus, we focused on that part of the Y chromosome long 
arm, which was delimited by the boundaries of the interstitial deletions of the patients with 
short stature (Fig. 1). Recently, a detailed physical map of the human Y chromosome 
incorporating 758 ordered STSs and 199 conq>letely sequenced BAG clones has been 
constracted (Tilford et aL 2001). We used a slightb^ modified PGR multiplex system 
(Henegariu et aL 1994) to test the absence or presoice of 28 DNA loci from the Y 
chromosome long arm. Jn patients where sufScient DNA was available for fiarther PGR 
analysis additional STSs were tested. As a result, 8 of 9 interstitial deletion breakpoints could 
be positioned (Fig. 1). As the deletions of patients JOLAR, #28, #63, #95, T.M., and #1947, 
aU with normal height, overly, most of the long arm of the Y chromosome could be excluded 
as a critical region for GCY. 

As the distal breaIq>oint of the deletion of patient #1972 does not reside within the sfpecific 
part of the Y chromosome long arm, the nature of the deletion (terminal or interstitial 
remained unclear. There was also no overlap of his deletion with the deletions of patients 
#1947 and T.M. Relying solely on the results obtained by the STS-based interstitial deletion 
mapping strategy, one could not formally exclude the region distal to sY158 as a potential 
critical region for GCY. However, multiplex PGR analysis always showed a less intense 
anq)lification product for STS sY157 (a Y-derived marker in close vicinity of sY158). To 
address this problCTo, the rearranged Y chromosome of patient #1972 was investigated in 
more detail. 

Fhiorescence in situ hybridization and sequence famTly variatit typing of patient #1972 
The overall integrity of the Y chromosome from patient #1972 was demonstrated by FISH of 
the cosmids LLOYNC03"M''34F05 (PARI) and LLOYNC03"M"49B02 (PAR2) as weU as 
the Y-centromere-specific probe Y-97 and the telontiere-specific probe 'all human telomeres' 
(data not shown). Being aware of the complex structural organization of the human DAZ 
locus (Fig. 2A), we specifically searched for sequence femily variants (SFVs). To prevent 
misjudging sequence errors as single nucleotide differences, PCR/restriction-digestion assays 
were developed only from SFVs present in at least two overlappii^ BAG clones. The 
locaiiisnr^^^. of thsse SFVs is shcT.Tx is Fi^. 23. Aii uiefier SF^T? cnxm rrpree^iH ^iisfc viinsnt^. 




ten unrelated noimal Gern^n males were typed. In all cases, the expected fragment pattern 
could be detected for the Y-chromosome derived sequences. In contrast, the fiagment pattern 
deduced from the gCTomic sequence of the chromosome 1-derived BAG clone RP11-560I18 
could not be confirmed (see Table 3 for detaiQ. Each SFV-spedfic PCR/restriction digestion 
was conq>ared to the presence/absence in the corresponding BAG clones. 

Typing the genomic DNA of patient #1972 for all four sequence femify variants 
(SKY10/Tsp509I, SKYll/Nlam, SKY12/MseI, and SKY13/Cac8I + Tfil) revealed the 
absence of one Y-derived non-allelic sequence variant (Table 3 and Fig. 2CJD). In the case of 
SKYIO the distal copy is deleted. Not surprisingly, in all other typing experiments the more 
proximal copy of the respective SFVs was ^own to be deleted. 

Next, we investigated these SFVs in the two patients with the niost distal breakpoints (#95 
and #1947). Using genomic DNAs, we determined that both non-allelic variants of SKYll, 
SKY12, and SKY13 and one non-allelic variant of SKYIO were absent in patient #1947, 
whereas for all tested SFVs one non-allelic variant was absent in patient #95. 

Taken together, these results provide evidence that the proximal breakpoint of the interstitial 
deletion present in the Y chromosonae of patient #1972 resides within the interstitial deletion 
of patient #1947, thereby excluding this genomic region as a potential critical interval for 
GCY. 

Refinement of the GC Y critical interval 

Based on the molecular analysis of the pericentric region of the long arm of the human Y 
chromosome (Willian[is and Tyler-Smith 1997), the physical extension of the GCY critical 
region as defined by the markers sY78 (DYZ3) and sY83 (DYSll) was estimated to 
constitute 1:6-1.7 Mb (Fig. 3 A) of DNA. The most proximal 400 kb of this region consist 
exclusively of Sbp satellite sequences separated fi:om the Y centromere oriy by Alu 
sequences. This constant part of the human Y chromosome is therefore unlikely to contain 
coding sequences. The remainder of the GCY critical region is conoposed of X/Y-homologous 
as well as autosontial/Y-homologous sequence blocks. At the onset of this study, only lixnited 
coverage in YAC clones was available for this region. In order to refine the GCY critical 
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interval and to generate gene finding substrates, it was necessary to establish a BACyPAC- 
contig of this region. 

We generated 25 additional markers mainly by sequencing the end fragments of BAC, PAC, 
and YAC clones as weU as clone-internal sequences ampMed by various combinations of 
Alu-AIu oligonucleotide primer pairs. Of those, only 7 turned out to be Y-specific (SKYl, 
SKY2. and SKY4-8) (see Table 2 for detail). The BAC and PAC clones identified during thi 
generation of the pl^^sical map are summarized in Table 4. Meanwhile, some of these clones 
have been completefy sequenced as they form part of a tiling path for sequencing the human 
Y chromosome (Tilford et aL 2001). The proximal part of the cloned region between markers 
sY78 and SKY6 has not been sequenced to date. A selection of clones covering the entire 
GCY critical region is depicted in Fig. 3. 

Confirming the overlap between BAC RPl 1-295P22 and BAC RPl 1.322K23 appeared to be 
the most crucial step in the process of contig construction. Y-specific markers derived from 
the opposite end fragments of both clones were suspected to amplify identical-sized fragments 
from two different loci within the same 5bp satellite region. By testing several restrictron 
en2ymes known to cut frequently within 5bp satellites composed of the consensus sequence 
(TGGAAX we developed k)ci-specific PCR/restriction digestion assays. Typing all BAC 
clones mapping to this sequence block with the appropriate PCR/restrictk>n d^estion assay 
allowed us to precisely position them therein confirming their overlaps. 

In order to narrow down the critical interval for the GCY gene, we tested for the presence of 
the newfy generated STS in patients #293. Y0308, and JOLAR. These resuhs allowed us to 
define a small region for the GCY gene (Fig. 3B, 4A). Direct sequence con5)arison showed 
that the sequenced BAC clones RP11-322K23, RPn-75F05, RP11-461H06, RP11-333E09, 
RP11-558M10, CITB-298B15, and CITB-203M13 completely cover the mapped region 
between Y-STSs SKY8 and sY83 (DYSll), suggestmg that it encompasses roughfy 700 kb. 
BasicaUy, the region can be subdivided in three distinct intervals: a proximal region 
characterized by 5bp repeats, a central region with high homology to chromosome 1, and a 
distal region composed of X/Y-homologous sequences. As the most distal part of the GCY 
critical region (beginning with bpl of BAC clone CITB-144J01) was already subject of 
ostsasivc r^aarch during tho process of cbarsicteHzstioii of ths S^y.Fs. ciixr-r^ rw'- ^ •'^ 




shown to harbour no functional gene (Sargent, et aL 1999), it was excluded from further 
detafled genomic DNA analysis. The most proximal part of the GCY critical region consists 
exchisively of satellite type 3 sequences of the 5bp consensus (TGGAA)a and is therefore also 
not assumed to contain any gene. Leaving these two regions out of consideration, we were 
able to concentrate our eflForts to a smaller mterval of 420 kb of DNA. Large-scale sequence 
comparisons performed by the Advanced P5)Maker software showed no integration of Y- 
specific sequences into the chromosome 1 and/or chromosome X-homologous regions. 
We have also established new Y-specific markers scattered uniformly across the entire 420Kb 
ofDNA(Tab5). 

Transcriptional potential of the GCY critical region 

To identify transcrq)tional units within the smaller 420 kb region, we performed exon 
an^lification on five PAC clones (RP1-148J07, RP5-1160A12, RP1-301P22, RP4.532I07, 
RPl-1 14A1 1). Nine positive clones were isolated (Tab. 6). Direct sequence comparison witii 
BAG clones showed that two clones were conq)osed of two exons, respective^. None of the 
clones shared homologies witii a known g&ai& or Y-specific ESTs. 

At the same time, we performed in silica anafysis vsrifh the BAG clone sequoices provided by 
the Hmnan Genome Project. By the use of several gene prediction programs, a total of IS 
gene models were proposed. We established Y-spedfic primer pairs for all predicted genes 
(Table 7a). Furthermore, four apparent pseudogenes (KIAA1470P, ASSP6Y, ADLP, and 
THC604695P) were identified by the BLAST alignment tooL To identify potential promoter 
regions, sequences were submitted to the FirstEF program. Extracted data are svimmarized in 
Fig. 4B. 

The most prominent candidate gene so &r seemed to be the Y-copy of Adlican, as its X- 
homologous counterpart was shown to be upregulated in osteoarthritic tissue. Additionally, a 
gene model was proposed for the Y-specific copy (cfL). Direct sequence comparison of the X- 
derived transcr^t with Y-derived BAG clones revealed no conservation of splice sites and a 
lack of exons 3 and 4 on the hmnan Y chromosome. Nevertheless, splicing sites may have 
experienced slight shifts on the human Y and exons 3 and 4 may not be essential for its Y* 
specific fimctioiL A BLAST homology search against the EST database showed that all 
Adlican-derived ESTs correspond to its X-cfaromosomal copy. RT-PCR showed no signs of 
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expression of the Y-derived copy so fer in 16 polyA^-RNAs tested. Ubiquitous «qiression 
xvas however detected of the X-derived copy vnth exception of all tested neurological tissues. 
This may indicate that the Y-copy of AdKcan represents a pseudogene. Ahemativefy, the 
expression profile of the Y-gene/pseudogene may be restricted in time and space and ™ not 
detectable due to these reasons. A summary of primer pahs specific for the Y-copy of AdKcan 
is given in Table 7A. 



Discussion 

Since the issue on the existence of a Y-specific growth gene (GC Y) was first raised, there 
have been several attempts to define its precise location. Whereas initial studies unanimousfy 
pointed towards a common region of the Y chromosome long arm (Salo et aL 1995), more 
recent investigations have led to the identification of two nonroverlapping critical intervals 
(Rousseaux-Prevost et aL 1996, Ogata et aL 1995, De Rosa et aL 1997). HSH analyses 
resolved this apparent contradiction by presenting clear evidence that the patient materials 
used in these initial investigations contained 45,X0 cells and/or i(Yp) or idic (Yqll) 
chromosomes (Kirsch et aL 2000). Both genetic parameters influence the adult height of a 
given individual, thereby rendering it inapossible to predict whether such patients have lost 
GCY or not. Studies with patients carrying de novo interstitial deletions are, therefore, much 
better suited to address the problem of GCY localization. 



In the course of winnowing the Uterature for patients with small interstitial deletions, in 
particular close to the centromere, it became clear that those patients are very rare. This 
prompted us to extend our search for patients carrying large de novo interstitial deletions, 
irrespective of their actual adult height. We examined 9 adult patients, 7 of whom presented 
normal height. Futhermore, we could show overlapping deletions, therein exchiding GCY to 
reside between the Y-spedfic marker DYSl 1 and the pseudoautosomal region 2 (PAR2). Two 
patients, #293 and Y0308, presented interstitial deletions enabling the restriction of the GCY 
critical region to approximatety 700 kb of DNA This region is therefore predicted to harbour 
one or more genes required for normal human growth. 




All 9 patients studied share infertility as a common phenotype, which is in agreement with 
their large Yq deletions. Despite extensive routine screening of infertile males in reproduction 
centers, onfy two patients were found to present borderline/short stature in combination with a 
confimied large de novo deletion. We therefore conchide that cytogeneticalfy detectable de 
novo deletions enclosing the GCY gene are rare events. In addition, the adult height reduction 
of 6-8 cm attributed to the Y-spedfic growth gene (Ogata and Matsuo 1992) does not 
necessarify result in the diagnosis of short stature in all afifected males. Sex-related adult 
height difference is detemiined by the level of Hoactive gonadal steroids and the Y-specific 
growth gene. Parameters such as nutrition, infectious diseases and secular trend are furthCT 
components influencing the adult height of a given individual. In particular, the mid-parental 
height contributes to the evaluation of grovs^ reduction. 

In summary, our data localizes GCY to a critical interval marked by the Y-derived markers 
SKY8 and sY83 (DYSIl). This 700 kb interval, recently sequenced by the Human Genome 
Project (Tilford et ai 2001), does not contain any known gene or any Y-specific ESTs. 
Different reasons such as unusual gene stractures, e.g. genes consisting of onJ^ one exon, the 
lack of homology to any identified gene, and spatia% or temporal^ restricted gene 
expression patterns could account for this phenomenon. 

The human genomic PAC and BAG libraries used in this work were constructed at the KPGI 
in BujBfelo, NY. Clones isolated from these libraries were purchased from the same institution. 
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Table 1 Adult height comparison of patients and their siblings 



Case 


of origm 


Height of patient (cm) 
and standard deviatinn 
score 


National 
neignt 

standard 
(cm) 


Heights of femily 
members (cm) and 
standard deviation 
score 


#293 


U.S.A. 


157 (SDS.2.9) 
short 


176.9 
(SD 6.8) 


(F) 170 
(M) normal 
(B)nonDal 


Y0308 


U.S.A. 


165.5 (SDS -1.7) 

borderline 

(short?) 


176.9 
(SD 6.8) 


(F) 170 
(M) 168 

(B) 188 (SDS +1 T) 
(S) 170 fSDS -0.4^ 


JOLAR 


United 
Kmgdom 


168 (SDS -1.0) 
normal 


174.7 
(SD 6.7) 


(F) normal 
(M) normal 
(B) normal 




Italy 


175 (SDS -0.3) 
normal 


176.7 
(SD 6.5) 


(F) normal 
(M) normal 


#63 


Ethiopia 


170 (SDS 40.3) 
normal 


168.0 

(SD 7.4) 


(F) normal 
(M) normal 


ms 


Israel 


185 (SDS +1.4) 
normal 


175.6 
(SD 6.8) 


(F) normal 

(M) normal 


T.M. 


Belgium 


182 (SDS +1.3) 
normal 


173.5 
(SD 6.7) 


(F) normal 
(M) normal 


#1947 


Gennany 


175 (SDS -0.8) 
normal 


179.9 
(SD 6.4) 


(F) normal 
(M) normal 


#1972 


Germair/ 


181 (SDS +0.2) 
normal 


179.9 
(SD 6.4) 


175(F) 
165 (M) 

172(S)(SDS+L0) 



The standard deviation score (SDS) was calculated based on the equation: SDS = (X-M)/SD, xvhere X is an 
individual's adult height and M and SD are the mean adult height and the ± 1 standard deviation of the nonnal 
populatim, respectively. 

(M) mother, (F) fether, (S) sister, (B) brother. (NA) not available. 




Table 2 Y-chromosomal S'TSs 



STS 


Left Primer 


Right Primer 


Product 


SKYl 


GGACATTTGGCTGCAGAGAT 


TGGCAATGCACTCTCATCAT 


255 


SKY2 


TCAGGACAGACAGGCTGCTA 


CCTGO^CTGAQCTCCTTAC 


~1700 


SKY3 


TTCTCCCTCATCTTCCAAGC 


GCTTCCATCCATTAGCAAGG 


167 


SKY4 


CCTTTCATTCCATTCTCTTCCA 


CGCACTTTATGGACTGCAA 


111 


SKY5* 


CCCTCGTCCATTTCTTTTGA 


CCTCGAATTTAATGGATTGC 


202 


SKY6* 


TCAATGGATGCACAGTGTGGC 


TCCACrOAATTCCATTGCAC 


328 


SKY7 


GGOAGTGCAAAGGGAAAGAT 


CTTTCX^ATGGGOTGACATTC 


223 


SKY8 


CCATTCATTCGAGTTCATTACG 


ATTGGAATGOAATCGOACAG 


189 


SBCY9 


GGCCGATGGTCAAACTGTTA 


GAAACGGGCTCTGAAATTCT 


531 


SKYIO* 


ATAAGGGGCAGGTTTGTCAC 


GCTACTTATTCAGTGTTTAACTGACAC 


329 


SKYll* 


AAAGTGGGTGAAGGACATGG 


TTTTTOTTTGTGGCAGOTG 


469 


SKY12* 


TTGAGTCACTGGGQATAACTO 


TATGGCCCACAATCACTTCA 


216 


SKY13* 


GGCAGCCTAGAAAGTCTTGTTC 


CCCTTGGGATTTTGTCTGTT 


198 



Maikm indicated wiA a * amplify DNA fiagments fiom more Hmn one genomic locus (seo Chapter MeslrteHon analysis ofPCR 
products for detail). 



Table 3 PCR/Restriction Digest Analvsis of Sequence Family Variants in the AZFc 



STS 


Restriction 
euyme 


BAG 
clones 


Fragment sizes (bp) 
after restriction 


STS 


Restriction 
mqme 


BAG 
clones 


Fragment sizes (bp) 
after restriction 


SKYIO 


Tsp509I 


487K20 


279,50 


sk:yi2 


Msel 


245K04 


88,57,39,32 






70G12 


329 






506M09 


145,39,32 






S60I1S 


329* 


















SICY13 


Cac81/Tfil 


100J21 


97,83,23 


SKYll 


Nlam 


24SK04 


217,154,79,19 






589P14 


175,23 






S06M09 


233,221,15 






251M08 


97,50,33,23 



the spaandc fiagment amplified by the primer pair SKYIO. Restricdon analysfa of fiagments amplified fiom male and fibmale genomic DNA. 
from a sonmlic ceU hybrid line containing chromoson« 1 as only Gfaromosome of human oiigm and fiom fiie BAC RP11-560I18 as well 
shows two fiagmenis of-180bp and -155bp indicating a sequence error in fiie oon^lete sequence of the BAC clone. 



Table 4 Smnmaiy of BAC and PAC clones identified during physical map creation 



Y-STSs 



Positive BACs(RPCIll) 



Positive PACs CRPCai, 3-5) 



sY83 
sY82 
GY8 

sY81 

14A3C* 

sY79 

SKYl 
SICY2 

SKY4 
SKY5 

SKY6 



not sareaied 
not screened 
notscreoied 

not screened 

notscre»ed 

75F05, 79E14, 102G24, 322K23, 
417D23, 600D11, 612E10, 725112, 
863108, 903M02, 1125H21 

376B16, 544C11, 544M21 
79P12, 295P22, 376L20, 828024, 
886111, 910C06 

75F05, 322K23, 612E10 
174124, 271E18, 295P22, 588E18, 
620J20, 632F11, 684H19, 705019 
174124, 271E18, 295P22, 588E18, 
620J20, 632F11, 684H19, 705019 



83D22 

83D22, 114A11, 157G08, 966C15 
114A11, 168E21, 271D03, 635F21, 
765H16. 806015, 904E13, 966C15 
301P22, 1079J08, 1078C20, 
1160A12 

148J07, 1136A14, 1160A12, 

1196123 

1149H11 



S6A05, 8SD24, 958M03 
829H08 

notscreoied 
not screened 

not screened 



* 14A3C is a hybridization probe previously described by Tyler-Smith et al. 1993. It detects a Y-spedfic 
HindHC-fragment of 3.5 kb and an addititmal autosomal fragment. 
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Table 7B Primer pairs for Y copy of Adlican 



Primersequence (5^— > 3^) 


Direction with respect to putative 
transcription orientation 




OAC 1 cc 1 uGCC 1 1 uAcrruA 


forward 


45-64 


AdlYExl 


TCTCTGTGGTGCTGATCCTG 


forward 


185-204 


cfl 


GGAGGAGCAAAAACAAGAAGAGA 


forward 


514-536 




ACTGATGAGCACGGGAACC 


forward 


602 - 620 


cfl-205 


TCCATCCTGAAAGTGCCTG 


forward 


1190-1208 


C17c 


ACATGTATACATGCTGCCAA 


forward 


1513 - 1532 


CI 8 


CAGCGAAGGAAAGCACATTT 


forward 


2178 - 2197 


AdlYExS 


GGCGACCTGAAGGGGACT 


forward 


2341 - 2358 


cfl-1915 


CTQTCCAGTCCTCAGGAAGC 


forward 


5090 - 5109 


C21 


GAAGCATCCACCAAAGCG 


forward 


5105-5122 


cfl -4679 

vX X i ^ 


ACAGCGGGCQCTATGAGT 


forward 


5972 - 5989 


cfl -4a 








CAGGATCAGCACCACAGAGA 


reverse 


204-185 


AdIYEx2 


CTGGGGAAQTTGGATTTTCTC 


reverse 


580 - 560 


C17b 


ACCAGGTTCCCGTGCTCA 


reverse 


624 - 607 


cfl-227 


GCAAGAATCTGGGCTCTCAC 


reverse 


915 - 896 


cfl 


ACTGTGATTCCCACCGTGAT 


reverse 


1638 - 1619 


C17c 


TTGTTTTGAGGAACGCCTCT 


reverse- 


1813 - 1794 


C18 


GGATGTGGGATCTGGTGAG 


reverse 


2505 - 2487 


cfl-2079 


GGGTGTAATTTTCTCCCATTG 


reverse 


3104 - 3084 


AdlYExS 


COTCCGTTTCAGCAGTGACA 


reverse 


5236 - 5217 


cfl-4810 


CTGACGTCCGTCCTCTGC 


reverse 


6144 - 6127 


cfl -4b 


ATGGACAGTGATCCGGTTTC 


reverse 


7159 - 7140 


cfl -6453 


TGAGCTGCACGATCAACCTC 


reverse 


7265 - 7246 


cfl -6559 
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